Graph Tree Decomposition Based Fast Peptide Sequencing and Spectral Alignment
نویسندگان
چکیده
De novo sequencing and spectral alignment are computationally important for the prediction of new peptides via tandem mass spectrometry (MS/MS). Both approaches are established upon the technique of finding the longest antisymmetric path on formulated graphs. The task is often complicated and the prediction accuracy is compromised when given spectra involve noise data, missing mass peaks, or post translational modifications/mutations. This paper introduces a graphical mechanism to describe relationships among mass peaks that, through 1 The preliminary version of this paper appeared in the proceedings of the 11th Pacific Symposium on Biocomputing (PSB2006). * Corresponding Author. Email: [email protected]. International Journal of Computational Science 1992-6669 (Print) 1992-6677 (Online) www.gip.hk/ijcs © 2008 Global Information Publisher (H.K) Co., Ltd. All rights reserved. Graph Tree Decomposition Based Fast Peptide Sequencing and Spectral Alignment GLOBAL INFORMATION PUBLISHER 1 graph tree decomposition, yields linear time and quadratic algorithms for optimal de novo sequencing and spectral alignment respectively. Our test results show that, in addition to high efficiency, the new algorithms can achieve desired prediction accuracy on spectra containing noise peaks and post translational modifications (PTMs) while allowing the presence of both b-ions and y-ions.
منابع مشابه
Fast De novo Peptide Sequencing and Spectral Alignment via Tree Decomposition
De novo sequencing and spectral alignment are computationally important for the prediction of new protein peptides via tandem mass spectrometry (MS/MS). Both approaches are established upon the problem of finding the longest antisymmetric path on formulated graphs. The problem is of high computational complexity and the prediction accuracy is compromised when given spectra involve noisy data, m...
متن کاملFast and accurate search for non-coding RNA pseudoknot structures in genomes
MOTIVATION Searching genomes for non-coding RNAs (ncRNAs) by their secondary structure has become an important goal for bioinformatics. For pseudoknot-free structures, ncRNA search can be effective based on the covariance model and CYK-type dynamic programming. However, the computational difficulty in aligning an RNA sequence to a pseudoknot has prohibited fast and accurate search of arbitrary ...
متن کاملgpALIGNER: A Fast Algorithm for Global Pairwise Alignment of DNA Sequences
Bioinformatics, through the sequencing of the full genomes for many species, is increasingly relying on efficient global alignment tools exhibiting both high sensitivity and specificity. Many computational algorithms have been applied for solving the sequence alignment problem. Dynamic programming, statistical methods, approximation and heuristic algorithms are the most common methods appli...
متن کاملEfficient Parameterized Algorithm for Biopolymer Structure-Sequence Alignment
Computational alignment of a biopolymer sequence (e.g., an RNA or a protein) to a structure is an effective approach to predict and search for the structure of new sequences. To identify the structure of remote homologs, the structure-sequence alignment has to consider not only sequence similarity but also spatially conserved conformations caused by residue interactions, and consequently is com...
متن کاملCross-Lingual Word Representations via Spectral Graph Embeddings
Cross-lingual word embeddings are used for cross-lingual information retrieval or domain adaptations. In this paper, we extend Eigenwords, spectral monolingual word embeddings based on canonical correlation analysis (CCA), to crosslingual settings with sentence-alignment. For incorporating cross-lingual information, CCA is replaced with its generalization based on the spectral graph embeddings....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008